Survey of Distributed Computing Frameworks for Supporting Big Data Analysis
نویسندگان
چکیده
Distributed computing frameworks are the fundamental component of distributed systems. They provide an essential way to support efficient processing big data on clusters or cloud. The size increases at a pace that is faster than increase in capacity clusters. Thus, based MapReduce model not adequate analysis tasks which often require running complex analytical algorithms extremely sets terabytes. In performing such tasks, these face three challenges: computational inefficiency due high I/O and communication costs, non-scalability memory limit, limited because many serial cannot be implemented programming model. New need developed conquer challenges. this paper, we review MapReduce-type currently used handling discuss their problems when conducting analysis. addition, present non-MapReduce framework has potential overcome
منابع مشابه
Distributed Data Processing Frameworks for Big Graph Data
Recently we create so much data (2.5 quintillion bytes every day) that 90% of the data in the world today has been created in the last two years alone [1]. This data comes from sensors used to gather traffic or climate information, posts to social media sites, photos, videos, emails, purchase transaction records, call logs of cellular networks, etc. This data is big data. In this report, we fir...
متن کاملAdvanced Visual Interfaces Supporting Distributed Cloud-Based Big Data Analysis
Handling the complexity of relevant data requires new techniques with regard to data access, visualization, perception, and interaction for innovative and successful strategies. As a response to increased graphics performance in computing technologies and Information Visualization, Card et al. developed the Information Visualization Reference Model. Due to further developments in Information Sy...
متن کاملA Survey of Statistical Methods and Computing for Big Data
Big data are data on a massive scale in terms of volume, intensity, and complexity that exceed the capacity of standard software tools. They present opportunities as well as challenges to statisticians. The role of computational statisticians in scientific discovery from big data analyses has been under-recognized even by peer statisticians. This article reviews recent methodological and softwa...
متن کاملSecurity Methods for Privacy Preserving and Data Sharing Over Cloud Computing and Big Data Frameworks
The cloud computing is one of the widely used services for resource management by many IT (information technology) and non-IT organizations due to its different benefits in terms of time saving and cost savings to the companies. Such cloud computing frameworks are used to store the small to big data efficiently. Most of companies want to store huge amount of data and hence along with cloud comp...
متن کاملBig Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks
The term ‘Big Data’ has spread rapidly in the framework of Data Mining and Business Intelligence. This new scenario can be defined by means of those problems that cannot be effectively or efficiently addressed using the standard computing resources that we currently have. We must emphasize that Big Data does not just imply large volumes of data but also the necessity for scalability, i.e., to e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Big data mining and analytics
سال: 2023
ISSN: ['2096-0654']
DOI: https://doi.org/10.26599/bdma.2022.9020014